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Abstract 

A fundamental problem in wireless networks is the 
minimum spanning tree (MST) problem: given a set 
V of wireless nodes, compute a spanning tree T, so 
that the total cost of T is minimized. In recent years, 
there has been a lot of interest in the physical interfer- 
ence model based on SINR constraints. Distributed 
algorithms are especially challenging in the SINR 
model, because of the non-locality of the model. 

In this paper, we develop a fast distributed ap- 
proximation algorithm for MST construction in an 
SINR based distributed computing model. For an 
re-node network, our algorithm's running time is 
0(Dlogn + filogn) and produces a spanning tree 
whose cost is within O(logn) times the optimal 
(MST cost), where D denotes the diameter of the 
disk graph obtained by using the maximum possi- 
ble transmission range, and u = log denotes 
the "distance diversity" w.r.t. the largest and small- 
est distances between two nodes. (When is 
re-polynomial, p = O(logre).) Our algorithm's 
running time is essentially optimal (upto a logarith- 
mic factor), since computing any spanning tree takes 
Q(D) time; thus our algorithm produces a low cost 
spanning tree in time only a logarithmic factor more 
than the time to compute a spanning tree. The dis- 
tributed scheduling complexity of the spanning tree 
resulted from our algorithm is 0(/j. log n). Our algo- 
rithmic design techniques can be useful in designing 
efficient distributed algorithms for related "global" 
problems in wireless networks in the SINR model. 



1 Introduction 

Emerging networking technologies such as ad hoc 
wireless and sensor networks operate under inherent 
resource constraints such as power, bandwidth etc. A 
distributed algorithm which exchanges a large num- 
ber of messages and takes a lot of time can consume 
a relatively large amount of resources, and is not very 
suitable in a resource-constrained network. Also, the 
topology of these networks can change dynamically. 
Communication cost and running time is especially 
crucial in a dynamic setting. Hence it becomes nec- 
essary to design efficient distributed algorithms for 
various network optimization problems that have low 
communication and time complexity, even possibly 
at the cost of a reduced quality of solution. (For ex- 
ample, there is not much point in having an optimal 
algorithm if it takes too much time, since the topol- 
ogy could have changed by that time.) For this rea- 
son, in the distributed context, such algorithms are 
motivated even for network optimization problems 
that are not NP-hard, e.g., minimum spanning tree 
(MST), shortest paths (see e.g., [8]). However, much 
of the theory of distributed approximation for vari- 
ous fundamental problems such as MST have been 
developed in the context of wired networks, and not 
for wireless networks under a realistic interference 
model. 

In this paper we focus on one of the most fun- 
damental distributed computing problems in wire- 
less networks, namely the Minimum Spanning Tree 
(MST) problem. It is a recurring sub-problem in 
many network and protocol design problems, and 



there has been a lot of work on distributed algo- 
rithms for computing the MST. Computing an MST 
by a distributed algorithm is a fundamental task, as 
the following distributed computation can be car- 
ried over the best backbone of the communication 
graph. Two important applications of an MST in 
wireless networks are broadcasting and data aggre- 
gation. An MST can be used as broadcast tree 
to minimize energy consumption since it minimizes 
S(« v)£T d a (u, f). It was shown in [2,7,33] that 
broadcasting based on MST consumes energy within 
a constant factor of the optimum. 

Much of the traditional work on distributed al- 
gorithms [24, 30] has focused on the CONGEST 
model of message passing, in which a node can com- 
municate with all its neighbors in one time step. 
This model is more suited for wired networks, and 
the complexity of many fundamental problems (e.g., 
MST [11], shortest paths [4], etc.) is very well 
understood in the wired model. This model does 
not capture interference, an inherent aspect of wire- 
less networks, which causes collisions when "close- 
by" nodes transmit. The Radio Broadcast Net- 
work (RBN) Model (see, e.g., [1, 12]) was developed 
specifically to address such issues. This model is 
defined on a unit disk communication graph G = 
(V, E), and the transmission from a node u to its 
neighbor v is successful, provided no other neighbor 
w 7^ u,w G N(v) transmits at the same time. There- 
fore, nodes that can transmit simultaneously form 
some kind of independent sets, and this model has 
been studied extensively over the last two decades. 
It is known that the RBN model is significantly dif- 
ferent from the CONGEST model [12,30], and its 
"local" structure has been used in designing efficient 
algorithms for many fundamental graph problems. 

Though the RBN model is much closer model of 
wireless transmission than the CONGEST model, it 
still does not capture several crucial features [ 13,27]: 
even if a node v receives packets from two neighbors 
u and w, the transmissions are not necessarily lost 
— this depends on the ratio of the strength of the 
u's signal to all other noise being above a threshold. 
On the other hand, simultaneous transmissions by a 
number of "well-separated" nodes (which is possible 
in the RBN model) might be infeasible in reality if 
the above-mentioned ratio of signal strengths is vio- 
lated at some receiver. Therefore, the complexity of 



wireless transmission is not captured by the simple 
"local" constraints of the RBN model. A different 
model, called the Physical Interference model based 
on SINR (signal-to-interference & noise ratio) con- 
straints (henceforth, referred to as the SINR model) 
has been proposed to rectify the weaknesses of the 
RBN model [13, 27, 28]. The SINR model more 
accurately captures the physical nature of wireless 
networks and has been used in a number of recent 
studies (e.g., see the recent survey of Lotker and Pe- 
leg [23] and the references therein). However, un- 
like traditional (wired and RBN) models, distributed 
algorithms are especially challenging to design and 
analyze in the SINR model, because of the non-local 
nature of the model. In particular, to the best of our 
knowledge no prior distributed algorithms have been 
designed for "global" problems in the SINR model. 
Algorithms in the RBN model do not translate to ef- 
ficient algorithms in the SINR model, whose non- 
locality makes it much harder. 

Centralized algorithms for various fundamental 
problems, such as independent sets, coloring, domi- 
nating set, spanning tree have been developed in this 
model in recent years, e.g., [14, 18,34]. However, 
distributed algorithms which satisfy the SINR con- 
straints at each step are only known for very few 
problems (e.g., [10, 17, 19]). Furthermore, the dis- 
tributed algorithms known are for "local" problems 
such as independent set and coloring and not for 
"global" problems. Global problem are those that re- 
quire an algorithm to "traverse" the entire network. 
Classical "global" problems include spanning tree, 
minimum spanning tree, shortest path etc. Network 
diameter is an inherent lower bound for such prob- 
lems. We note that the known algorithms in the SINR 
model for spanning tree problem and the connectiv- 
ity problem [3,26, 29] are in the centralized setting. 

In this paper, we describe the first distributed algo- 
rithm in the SINR model for approximate MST con- 
struction. The SINR constraints are satisfied at all 
steps of the algorithm, which gives a spanning tree 
with cost O(logn), relative to the MST cost. The 
running time is within a polylogarithmic time of the 
lower bounds in the RBN model. We discuss our re- 
sults in more detail below. 

(1) The running time of our algorithm is 
0(D\ogn + /ilog?i), with high probability, where 
D is the diameter of the graph restricted by the 
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maximum range. This is optimal up to a polylog- 
arithmic factor, since computing any spanning tree 
takes tt(D) time. In particular, if the maximum 
power level is unconstrained, the running time is 
0(/ilogra). The spanning tree produced by our al- 
gorithm has a low distributed scheduling complex- 
ity (defined in Section 3.4) of 0(/i log n), i.e., the 
transmission requests on the edges can be scheduled 
distributedly in 0(/i log n) time steps, for any orien- 
tation of the edges. 

(2) Our main technical contribution is the adap- 
tation of the technique of "Nearest Neighbor Trees" 
(NNT) [?, ?, ?] to the SINR model. This technique 
results in spatial separation at each step, which helps 
in ensuring the SINR constraints. Our algorithmic 
design technique can be useful in designing efficient 
distributed algorithms for related "global" problems 
in wireless networks in the SINR model. 

All prior distributed algorithms in the SINR model 
have been restricted to scheduling kind of problems 
(e.g., independent sets, coloring and broadcasting). 
Our result is the first distributed algorithm for a prob- 
lem not in the above class, and might be useful in 
other network design problems. 

2 Preliminaries and distributed 
primitives 



Table 1: Notation. 



V 


set of nodes 


n 


#nodes 


D 


network diameter 


d(u, v) 


dist. between u, v 


a 


path-loss exponent 


P 


SINR threshold 


N 


background noise 


P 


transmission power 




distance diversity 


r 


transmission range 



Let V denote a set of tranceivers (henceforth, 
referred to as nodes) in the Euclidean plane, and 
d(u, v) the Euclidean distance between nodes u, v. 
Let d m i n and d max denote the smallest and the 
largest distances between any two nodes respec- 
tively. d m i n is a constant as a result of the dimen- 
sion of a node as a wireless device. We normalize 
the distances, such that d m in = l.We use D to de- 
note the diameter of the disk graph obtained based 
on V by using the maximum possible transmission 
range. We say a node u is in the ball B(v, r) of node 
v if and only if d(u, v) < r. Let P(u) denote the 



transmission power chosen by node u, with a max- 
imum power level possible, denoted by P m ax- We 
also assume that the nodes are capable of "adaptive 
power control", which means that they can transmit 
at any power level in the range [0,P max ]. We as- 
sume the commonly used path loss models [?, 6], in 
which the transmission from u to v is possible only 

if: W$Uj/ N ^ & where « > 2 is the "path- 
loss exponent", /3 > 1 is the minimum SINR re- 
quired for successful reception, and N is the back- 
ground noise (note that a, (3 and N are all constants). 
This inequality also defines under a certain trans- 
mission power P the transmission range, which is 
the threshold distance beyond which two nodes can- 
not communicate with each other, and which equals 
yP/JWfi). To reduce notational clutter, we will 
say that the transmission range r associated with a 
transmission power P is r = (P/c) 1 /", for a con- 
stant c. Let r max = (Pmax/c) 1 / a denote the max- 
imum transmission range of any node at the maxi- 
mum power level. W.l.o.g., we assume r max < d max 
in our algorithms. 

We say that a set S of nodes in the plane form a 
"constant density set" w.r.t. range r if there are O(l) 
nodes within the ball B(v, r) of any node v, where v 
is the center and r is the radius. A set S' C S is said 
to be a constant density dominating set for S w.r.t. 
range r, if S' is a constant density set, and for each 
v G S, there exists u G S' such that v G B(u,r). 
Given a set S of nodes and range r, we define 
G r (S) = (S,E = {(u,v) :u,ve S,d(u,v) < r}) 
to be the graph induced by S with range r. 
Wireless interference. We use physical interference 
model based on geometric SINR constraints (hence- 
forth referred to as the SINR model), where a set L 
of links can make successful transmissions simulta- 
neously if and only if the following condition holds 
for each / = (n, v) G L: 

^ « 

V y[ - u I 4- N 

l^u'eV\{u} d a (u',v) + iV 

where V is the set of transmitting nodes. Such a set 
L is said to be feasible in the context. 
MST-SINR: The minimum spanning tree prob- 
lem under the SINR model. Given a set V of wire- 
less nodes, the goal is to find a spanning tree T, such 
that the total cost of T is minimized, w.r.t. a cost 
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function cost(u, v) = d(u, v) for any pair of nodes 
(u,v); for a set E of edges, we define cost(T) = 
S( u v)eE cos t{ u : v )- We focus on developing dis- 
tributed approximation algorithms. We say an algo- 
rithm gives a 7-approximation factor if it constructs 
a spanning tree T, with cost(T) < 7 • cost(MST), 
where MST represents an optimum solution. In the 
problem we study here, we only require the tree to be 
constructed implicitly — we assume we have a sink 
node s, and we define a parent, par(v) for all nodes 
!//s, such that the set of edges {(v,par(v)) : v € 
V,v 7^ s} form a spanning tree. Each node only 
needs to know the identity of its parent. The goal of 
this paper is to design an algorithm for computing 
an approximate MST in the distributed computing 
model based on SINR constraints (described below); 
we do not require the transmissions (in either direc- 
tion) on all the edges in the tree to be simultaneously 
feasible in the SINR model. 

Distributed computing model under the SINR 
model. Traditionally, distributed algorithms for 
wireless networks have been studied in the radio 
broadcast model [5, 25, 32] and its valiants. The 
SINR based computing model is relatively recent, 
and has not been studied that extensively. Therefore, 
we summarize the main aspects and assumptions un- 
derlying this model below. 

(1) The network is synchronized and for simplic- 
ity we assume all time slots have the same length. 

(2) The graph G r . max / c {V) induced by V with 
range r max /c is connected, i.e., the graph can be 
connected by using a transmission power level of 
Pmax/c' which is slightly less than the maximum but 
within a constant factor. 

(3) All nodes have a common estimate of n within 
a n c for some constant c. 

(4) All nodes share a common estimate of d m i n = 
1 and d max , the minimum and maximum distances 
between nodes. We use u = log to denote the 
"distance diversity" (similar to the "link diversity" in 
[?]), which is the number of classes of similar length 
edges into which the set of all edges can be parti- 
tioned. It is common to assume that \i = 0(log n). 

(5) As mentioned earlier, we assume nodes are 
equipped to be able to do adaptive power control, 
i.e., each node v can transmit at any power level 

Pe[0,P m ax}. 



2.1 Distributed primitives 

We discuss two primitives that are needed in our 
MST algorithms for local broadcasting and dominat- 
ing sets. The results of [15,31] directly provide us 
efficient implementations for these two problems. 
1. Local broadcasting: The local broadcasting 
range rj is the distance up to which nodes in- 
tend to broadcast their messages. We say the 
local broadcasting from S to S' is successful if 
and only if for each node u € S with trans- 
mission power Pi,, all the nodes in S' within 
it's local broadcasting range r& receives the mes- 
sage from u. We assume we have an algorithm, 
Local-Broadcast^, S',rt,,P},), which takes sets 
S, S' of nodes, a distance r& and a power level P^ as 
input, and ensures that the local broadcasting from 
S to S' is successful. A small modification of the 
local broadcasting scheme of [15] achieves this step, 
which is described below. 

Lemma 2.1. Given two sets S, S' of nodes, a lo- 
cal broadcasting range rj, and a power level P^ = 
c ' r b> f or a constant d , there is a distributed algo- 
rithm LocalBroadcast(S, S', r^, P&) that runs in 
0(N(S, n /rb)log\S\) time where N(S,^rb) is the 
maximum number of nodes in S within a distance 
jrb of any node in S and 7 is a constant, such that: 
(i) each node v € S' receives the message from all 
nodes in S within distance w.h.p., and (ii) each 
node v £ S' is able to selectively ignore messages 
from any node u € S' which is beyond distance jr^- 

Proof sketch). We describe the protocol as follows. 
Following [15] we use random access, in which each 
node in S transmits at Power P& with prob. N ^g^ rb ^ 
(known to each node in S), and each node in S' 
senses the channel to receive messages. Each time 
a node w£S' receives a message, v checks the total 
power received; if that exceeds (zf^ra + N, v dis- 
cards the received message. 

Property (i) in Lemma 2.1 directly follows from 
the proofs in [15], by partitioning the space into rings 
and uppering bounding the stochastic interference. 
Then Property (ii) follows from the condition we put 
on accepting a message based on SP(v), such that 
any message sent from a node beyond distance 7^ 
from v will be ignored, where 7 is a constant. □ 
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2. Constant density dominating set: We assume 
an algorithm ConstDominatingSet(5, r c ) that 
takes as input a set S of nodes and a range r c < 
fmax, and produces a constant density dominating 
set S' <Z S corresponding to this range, such that for 
each node in S there is a node in S' which is with 
distance r c , and the density of the nodes in S' is at 
most a constant. By density, we mean the number 
of nodes in S' within a range r c of any node in S'. 
The algorithm in [31] serves this purpose, with the 
performance summarized below. 

Lemma 2.2 ( [31]). Given a set S of nodes and a 
range r c < r max , a constant density dominating set 
for S can be constructed in time 0(log n) w.h.p. un- 
der the SINR model. 

2.2 Nearest Neighbor Tree Scheme 

The algorithmic paradigm underlying our algorithm 
is the Nearest Neighbor Tree (NNT) scheme [?, ?, ?]. 
The NNT scheme can be used to construct a span- 
ning tree, called the Nearest Neighbor Tree (NNT), 
efficiently in a distributed fashion. The cost of the 
NNT can be shown to be within an O (log re) factor 
of the cost of the MST The scheme used to construct 
an NNT (henceforth called NNT scheme) consists of 
the following two steps: 

(1) each node first chooses a unique rank from a 
totally-ordered set; a ranking of the nodes cor- 
responds to a permutation of the nodes; 

(2) each node (except the one with the highest rank) 
connects (via the shortest path) to the nearest 
node of higher rank. 

It can be shown that the NNT scheme constructs 
a spanning subgraph in any weighted graph whose 
cost is at most O(logre) times that of the MST, ir- 
respective of how the ranks are selected (as long as 
they are distinct) [?, ?]. Note that some cycles can be 
introduced in step 2, and hence to get a spanning tree 
we need to remove some edges to break the cycles. 
The main advantage of the NNT scheme is that each 
node, individually, has the task of finding its near- 
est node of higher rank to connect to, and hence no 
explicit coordination is needed among the nodes. 

However, despite the simplicity of the NNT 
scheme, it is non-trivial to efficiently implement the 
scheme in an arbitrary weighted graph. The work of 



[?] showed how to efficiently implement the scheme 
in an arbitrary weighted graph in the CONGEST 
(wired) model. This implementation was shown to 
have a running time of 0(D(G) + L(G, w)) where 
L(G, w) is a parameter called the local shortest path 
diameter and D(G) is the (unweighted) diameter of 
the graph. This distributed implementation cannot be 
directly used here, due to the (additional) complica- 
tion of having to obey SINR constraints when each 
node tries to search for the nearest node of higher 
rank to connect to. Second, the local shortest path 
diameter can be significantly larger than the diame- 
ter of the underlying graph. In particular, it can be 
as large as n, in which case, the above implemen- 
tation does not give the time bound that we would 
like to show in this paper, i.e., close to the diameter 
of the underlying graph. Note that this is essentially 
the best possible, since computing a spanning tree 
on an arbitrary graph takes diameter time. To show 
this stronger bound, we exploit the fact that the un- 
derlying graph has a geometric structure. The main 
idea is to choose ranks in a particular way (in step (1) 
of the NNT scheme) that guarantees the each node 
can find its nearest node of higher rank fast. This 
idea was first implemented in the UDG-NNT (Unit 
disk graph-NNT) algorithm [?]. However, again this 
implementation does not directly work in the SINR 
model. We show that one has to implement both step 
(1) and step (2) of the NNT scheme in a incremen- 
tal and staged fashion, so that SINR constraints are 
obeyed and still many nodes can progress simultane- 
ously. We show that NNT technique results in spatial 
separation of "active" nodes (i.e., nodes that need to 
communicate) in each step, and hence is amenable 
to obey SINR constraints. The details are in the next 
Section. 

3 Approximating the MST in the 
SINR model 

We now discuss the algorithm MST-SINR for find- 
ing a low cost spanning tree. We start with the 
main intuition for the algorithm. The basic idea for 
satisfying SINR constraints at each step is to en- 
sure that the senders are "spatially separated". In 
many distributed MST algorithms, e.g., the algo- 
rithms of [?, 11], there is no guarantee that senders 
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are well separated. However, the Nearest Neighbor 
Tree (NNT) scheme [?, ?, ?] has a nice property that 
nodes gradually increase their range, and the nodes 
which are active in any round are sufficiently well 
separated. However, this approach has only been 
studied in CONGEST model in [?,?,?], and we adapt 
it to the SINR model. For illustration, first imagine 
that the maximum power P m ax is "unbounded" so 
that the d max < r max . The algorithm in this special 
case involves the following steps. 

(1) We have log r max < fi phases, ranging from i = 
1, . . . ,logr max . 

(2) In the zth phase, a subset Si of nodes participate, 
and the edges chosen so far form a forest rooted 
at nodes in Si. The nodes in Si transmit at power 
level of c • df for a constant c, where di = 2\ 

(3) In the ith phase, each node v G Si runs the NNT 
scheme: v connects to a "close-by" node in Si 
within distance d • di of higher rank, if one exists, 
for a constant c' . The nodes which are not able 
to connect continue into phase i + 1. 

We need to prove that the above steps can be im- 
plemented in the SINR model, and the resulting im- 
plicit tree is a low cost tree, relative to the MST 
One complication, in contrast to the original NNT 
scheme, is that there is no way to ensure that a node 
v € Si connects to the "closest" node in Si of higher 
rank, within distance dj. Because of the SINR model, 
it is possible that a transmission from some far away 
node could be received by v. We show that the prob- 
ability of this event is very low, so that with high 
probability, we get a tree that is "close to" the NNT, 
leading to the logarithmic approximation bound for 
the cost of the tree. We describe the above algorithm 
as subroutine NNT-SINR-BP in Section 3.1, with 
the modification that P max is bounded, and instead 
of \i phases, we only have log r max phases. The re- 
sult of NNT-SINR-BP is a forest with the highest 
rank nodes forming the roots. 

In the general case, P max is bounded, so that 
d m ax > Tmax, and all we have is that the graph 
induced by range r max is connected*. If we run 
the above bottom-up algorithm with some maximum 
range r\, we would get a forest in which the roots are 
at least c • r\ apart, where c is a constant. In order to 

'Recall that in Section 2, we in fact assume something 
stronger: the graph induced by a range of r max /c is connected 
for a constant c. 



connect up the roots, we first use a "top-down" ap- 
proach where we choose a set Dom of nodes so that 
(i) each node v Dom is within distance n of some 
node in Dom, and (ii) for each node u £ Dom, there 
are a "small" number of nodes within distance r\. 
The idea is that if we can construct a spanning tree 
on Dom (in the SINR model), and can ensure that 
each node in V \ Dom has a higher rank neighbor in 
Dom, within distance r±, then the above bottom-up 
phase restricted to V \ Dom will allow the trees to 
be combined. Such a set Dom is precisely a "con- 
stant density dominating set", as defined and com- 
puted efficiently in [31]. The spanning tree on Dom 
is constructed by adapting the UDG-NNT algorithm 
of [?], which starts with a node s and spreads the 
ranks from it. This adaptation is discussed below as 
subroutine NNT-S INR-CD, which also works in the 
SINR model because the nodes in Dom are spatially 
well separated. 

We first describe the two subroutines, and then dis- 
cuss the main algorithm. 

3.1 Constructing forests with power con- 
straints 

For subroutine NNT-SINR-BP, we are given a set 
S, a maximum power level P max (corresponding to 
a range r max ), and a rank function rank(-), which 
assigns unique ranks for all nodes in S. The goal 
is to construct a forest, in which each node connects 
to a parent within range r max . As discussed earlier, 
NNT-SINR-BP uses the NNT approach of [?, ?, ?], 
in which each node connects to the nearest node of 
higher rank, which leads to a forest. However, in 
order to be feasible in the SINR model, we need to 
do this in a careful and staged manner, so that the set 
of transmitting nodes in each round form a constant 
density set. 

Lemma 3.1. At the beginning of each phase i, for 
each node v 6 Si, there are at most a constant 
number of active nodes in the ball B(v,di), i.e., 
\B(v, d{) P) <Si| = 0(1). Further, for each node 
v £ SiJhere are no other active nodes in the ball 
B{v, di/2), i.e., B{v, di/2) f| # = {v}. 

Proof by induction. Recall that Si is the set of active 
nodes at phase i. For phase I, Si = S and do = 
1; obviously there can be at most 16 nodes in the 
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ball. Assume that for phase i, the statement is true. 
Then, at the beginning of phase i + 1, if v is active, 
according to the algorithm, all nodes in the range of 
di of v should all have a lower rank and thus are all 
inactive. Therefore, the distance between any two 
nodes in Si+i is at least di. By packing property, 
there can be at most 16 nodes in the ball B(v, di+i). 
Therefore, the statement is true for eveiy phase. □ 



Algorithm 1: NNT- S INR-BP (S, P ma x ,rank(-)) 
Input : node set S, max power P ma x, and 

value rank(v) for each v G S 
Output: set F of edges 

1 Each node v G S does the following: 

2 fmax ^ (Pmax / C) ^ > ^max 4 L^§2 ^max\ > 

3 initially S% 0,Vi > 1; # S< denotes the set 

of active nodes in phase i 

4 5i <- S; 

5 for i = 1 to i m ax do # in each phase 
# v broadcasts its rank to all nodes 

in Si within range di 

6 di <- 2\ Pi <- c'df; # d is a constant 

7 if V G Si then # broadcast rank value 

v broadcasts its rank by participating in 

LocalBroadcast(5j, Si,di, Pi); 

9 end 

10 S v ^— set of nodes with ranks received by v; 
v' <— the node with highest rank in S v ; 

if V = v' then # v has highest rank 
locally: add v to the active set Sj+i 
for next phase 

add v to Si+xl 



V ; # i! is done 
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else par(v) 
is end 

16 Ff- {(u,par(u)) : par(v) / 0, G 5}; 



Lemma 3.2. /« the end of the algorithm, the set F 
forms a forest. Further, the set Si max forms the set of 
roots of F, and for any two nodes u, v G Si max , we 
have d(u,v) > r max /2. 

Proof. Because of the NNT property, in which each 
node only connects to a higher rank parent, there are 
no cycles. Next, by design, the nodes in Si max do 
not connect to any other node, and form the roots of 
the forest F. By Lemma 3.1, it follows that for any 
u,v £ S imax , we have d(u, v) > r max /2. □ 



Lemma 3.3. The cost of the forest F is at most O(p) 
times that of of the minimum spanning forest of the 
nodes in F. Further, NNT-SINR-BP is feasible in 
the SINR model, with running time of O(tilogn), 
w.h.p. 

Proof For i < i max , let F» = {(v,w) G F : v G 
Si,w = par(v) G Si + {\ denote the set of edges in 
F between nodes in sets Si and Si+i. Recall that Si 
denotes the set of nodes which remain active in the 
beginning of phase i. Let MST(Si) denote a mini- 
mum spanning tree of only the nodes in Si. W.l.o.g., 
we assume |5^| > 1. For any pair of nodes in Si, the 
mutual distance is at least di/2 (from Lemma 3.1), 
it follows that cost(MST(Si)) > (\Si\ - 1)^/2 > 
\Si\di/4. Next, we have cost(Fi) < di\Fi\ < di\Si\, 
implying cost(Fi) < 4cost(MST(Si)). Since there 
are 0(\ogr max ) phases and cost(MST(Si)) < 
cost(MST), we have cost(F) = £V cost(Fi) < 
4(logr max .) Yli cost(MST(Si)). We have assumed 
r ma x < d max ; therefore, \ogr max < fi, implying the 
statement of approximation ratio. By Lemmas 2.1 
and 3.1, each run of the local broadcast primitive fin- 
ishes in 0(log n) time, so does each phase of the al- 
gorithm. The statement of running time follows. □ 
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Algorithm 2: NNT-SINR-CD^, s, r) 
Input : constant density node set S, sink s, 
range r 

Output: spanning tree T on S 

1 Each node v € S does the following: 

# in phase : 

2 if V = S then # s broadcasts to neighbors 

v generates a large rank(v), and broadcasts 
it by LocalBroadcast({s}, S, r, cr a ); 

4 else v listens; 

# in each phase i: 

5 fori = lto2D(G r {S))do 

6 Si <- 0; 

7 if u receives a message m = (id', rank') 
then 

if m is the first message v ever receives 
then 

v generates rank(v) randomly such 
that rank(v) < rank'; 
Si^ SiU {v}; 
v participates in 

LocalBroadcast(Si, S, r, cr a ); 

end 

add id' to list L(v), if rank' > rank(v); 

end 
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n 

12 
13 
14 

is end 



16 par(v) <— one of the nodes in L(v); 

17 return the set {(v,par(v)) : v ^ s} of edges 



3.2 Spanning trees for instances with con- 
stant density 

We now consider the problem of constructing a low 
cost spanning tree for a constant density instance. 
Subroutine NNT-SINR-CD takes such a set S, a 
sink s and a range r as input. As defined in Sec- 
tion 2, G r (S) denotes the S'-induced graph based on 
range r. We adapt the UDG-NNT algorithm of [?] 
for producing a spanning tree. Let D(G r (S)) de- 
note the diameter of G r (S); we assume (an estimate 
of) D(G r (S)) is known to all the nodes. We note 
that in Line 1 1 of the algorithm, the local broadcasts 
are run simultaneously for all nodes v, which get the 
rank message for the first time. Further, each itera- 
tion of the for loop in Lines 6-14 involves 0(log n) 
time steps, in which the local broadcast is run; nodes 
v which do not have to send their ranks in some iter- 
ation remain silent during these steps. 

Lemma 3.4. If G r (S) is connected, and S 
is a constant density set with range r, Al- 
gorithm NNT-SINR-CD produces under 
the SINR model a spanning tree T\ on S 
with cost(T{) = 0(OPT(G r (S))) in time 
0(D(G r (S))logn) with high probability, where 
OPT(G r (S)) = cost{MST(G r (S))). 

Proof. Our proof mimics the proof of algorithm 
UDG-NNT from [?]. Since G r (S) is connected, 
each node v ^ s is able to run line 12, and the 
tree T\ is constructed. The simultaneous calls to 
LocalBroadcast all take time O(logn) in each 
round (with high probability), so that the overall 
running time is O (D (G r (S)) log n), w.h.p.. Since 
the LocalBroadcast algorithm is feasible in the 
SINR model, NNT-SINR-CD is also feasible. 

The constant density property of S implies that 
any ball B(v,r') with r' < r has 0(1) nodes in S. 
Therefore, cost(MST(G r (S))) = Sl(\S\r). Next, 
the local broadcasts (Lemma 2.1) ensure that each 
node v receives messages from nodes within distance 
c'r for a constant d . Therefore, cost(T\) < \S\c'r, 
and the lemma follows. □ 



3.3 Putting everything together: Algorithm 
MST-SINR 

Our high level idea is to start with a constant 
density dominating set Dom, and run subroutine 
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NNT-SINR-CD to construct a tree T\ on Dom. We 
then choose and disseminate ranks suitably, and run 
NNT-SINR-BP to form a forest, which get con- 
nected to the nodes in Dom, and together form a 
spanning tree. 

Our analysis, at a high level, involves the follow- 
ing steps: we first show that T\ is a spanning tree on 
Dom with range 2r max /c, and has low cost. Next, 
we show that Algorithm NNT-SINR-BP with the 
range of r max results in a forest on V \ Dom, each 
of whose components gets connected to some node 
in Dom, because of the way the ranks are chosen. 
Finally, we show that the combined tree produced in 
this manner has low cost. 

Lemma 3.5. The graph G 2rmax / c (Dom) induced by 
Domw.r.t. a range of 2r max /c is connected, and has 
a diameter of at most 2D. 

Proof. Suppose G2 rmax / c (Dom) is not connected. 
Then there exists a partition Dom = Dom\ U 
Dom2, which induce disconnected components, 
and mm {uv)eDomixDom2 {d(u,v)} > 2r max /c, 
since the dominating set Dom constructed uses 
range r max /c. That implies that G is connected 
with r rnax /c, which is a contradiction. Therefore, 
G2r max / c {Dom) is connected. □ 

Lemma 3.6. Algorithm MST-SINR produces a 
spanning tree of cost 0(/u) times the optimal in time 
0(D log n + p,\ogn), with high probability, in the 
SINR model. 

Proof. From Lemma 3.4, the call to NNT-S INR-CD 
produces a spanning tree T\ on set Dom. Next, con- 
sider the call to NNT-SINR-BP in Line 14 of the 
algorithm. Let V' = {v G V \ Dom : par(v) = cf)} 
be the set of nodes for whom the parent is not defined 
during this call. Each nodes in V corresponds to a 
tree root of the forest constructed in this call, i.e., the 
edges F\ = {{v,par{v)) : v £ V \ Dom, par (v) ^ 
4>} (constructed during this call) form a forest, rooted 
at nodes in V. In Line 12, each node in V \ Dom 
registers its dominating node in T\ \ in Line 17, each 
node v G V' uses this information to connect to T\. 
Therefore, T\ U F x U {(v,par(v)) : \/v G V'} is a 
spanning tree. 

From Lemma 3.4, we have cost{T\) = 
0{OPT). Next, from Lemma 3.3, it follows that 



Algorithm 3: MST-SINR(y) 



Input : node set V 
Output: spanning tree T 

# use the algorithm of [31] to construct 
a constant density dominating set Dom 
w.r.t. range r max /c, for a constant c 

1 Dom <— ConstDominatingSetfV, r max /c) 
for a constant c; 

# we adapt the UDG-NNT algorithm of [?] 
for constructing a spanning tree on 
Dom 

2 s <— a node in Dom; # si nk node 

3 T\ <- NNT-SINR-CD(Dom,s,2r moa; /c); 

4 for each v G Dom do 

s | rank(v) <— the rank chosen in this process; 

6 end 

# each v £ Dom broadcasts its rank to 
nodes within its range 

7 run 

LocalBroadcast(-Dom, V \ Dom, r max /c) 
such that all v G Dom to broadcast rank(v); 

8 for each v G V \ Dom do 

9 
10 
11 

12 



b(v) <— largest rank received by v; 
q(v) <— id of the node that sent v rank b(v); 
v chooses rank(v) uniformly at random 
such that rank(v) < b(v); 

dom(v) q(v)\ # to connect to tree 
roots of forest Fi constructed below 

13 end 

# use the ranks chosen above to build a 
forest with tree roots in Dom to 
connect up with the tree T\ 

14 Fi <r- 

NNT-SINR-BP(V \ Dom, P max ,rank(-)); 
is for each v G V \ Dom do 

16 if par(v) = then # v is a tree root in 
forest Fi 

17 par(v) dom(v); # connect to Ti 

18 end 

19 end 

20 return set {(v,par(v)) : par(v) ^ 0} of edges. 
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cost(Fi) = 0(n ■ OPT). Finally, for each 
v G V', d(v,par(v)) < c'r max /c. Therefore, 
SueV" d(v,par(v)) = 0(OPT), which implies 
the cost of the tree produced by NNT-SINR-BP is 
0{n ■ OPT). 

Finally, we analyze the running time. The call to 
ConstDominatingSet in Line 1 takes O(logn) 
time. From Lemma 3.4, the call to NNT-S INR-CD 
takes time 0(D log n), with high probability. Next, 
the local broadcast takes time 0(log n), and the call 
to NNT-SINR-BP takes time O(filogn). Putting 
all of these together, the total running time is 
0(Dlogn + /xlogn), with high probability. From 
[31], and Lemmas 3.4, 3.3 and 2.1, all the com- 
putations of the algorithm are feasible in the SINR 
model. □ 

3.4 Scheduling complexity of the spanning 
tree 

The scheduling complexity (as defined in [29]) of a 
set of communication requests is the minimal num- 
ber t of time slots during which each link request 
has made at least one successful transmission. Due 
to the constant factor approximation algorithm by 
Kesselheim [?] for finding a maximum feasible set 
of links, any spanning tree has a scheduling com- 
plexity of O(logn) under the SINR model. Here, 
we use the term distributed scheduling complexity to 
describe the scheduling complexity for a set of links 
that needs to make transmission decisions in a dis- 
tributed fashion. The spanning tree T produced by 
our algorithm has the following property. 

Lemma 3.7. The spanning tree T produced by Algo- 
rithm MST-SINR has a distributed scheduling com- 
plexity of O(plogn), regardless of the direction of 
the transmission requests on the edges. 

Proof by construction. Let the edges in T be ori- 
ented in arbitrary directions, so that we obtain a set 
L of links, which represent a set of transmission re- 
quests. 

We construct a feasible schedule which can be im- 
plemented in a distributed fashion. We partition the 
links into length classes 1,2,..., log r max , such that 
all links in length class i have a length between 2 l_1 
and 2\ In each phase i € [1, log r max /c], links 
in length class i transmit with probability l/K in 



each time step, where K is an upper-bound of the 
number of edges/links in length class i with an end 
node within range 2' +1 of each node. With the same 
proof approach as that in [15], we can prove that in 
O(Klogn) time steps, all the links in length class i 
have made successful transmissions with high prob- 
ability. This way, in log r max /c] phases, all the links 
have made successful transmissions, w.h.p. 

Now, we argue that K is O(l) in each phase, and 
thus the total schedule length is O(filogn). Due 
to the construction of T in Algorithm MST-SINR, 
since Dom is already a constant density node set, 
we only need to analyze the edges formed during the 
call to NNT-SINR-BP<y \ Dom, P max ,rank(-)) 
in Line 14 of Algorithm MST-SINR. In each phase 
of the NNT-SINR-BP, the lengths of edges in- 
creases upon the previous phase. In one phase dur- 
ing NNT-SINR-BP, the newly formed edges corre- 
sponds to a length class, and the nodes involved in 
that phase form a constant density node set, accord- 
ing to Lemma 3.1. That implies that the number of 
edges in length class i with an end node within range 
2 i+1 of each node is 0(1). 

In terms of the distributed implementation of 
scheduling, each edge may remember the phase 
number in NNT-S INR-BP when they are connected 
up by a new edge, and participate in the phase of 
the same phase number of our scheduling construc- 
tion. □ 

4 Conclusion 

In this paper, we describe the first distributed algo- 
rithm in the SINR model for approximate minimum 
spanning tree construction. This is the first such re- 
sult for solving a "global" problem in the emerging 
SINR based distributed computing model — this is 
in contrast to "local" problems, such as independent 
sets and scheduling, for which distributed algorithms 
are known in this model. Our algorithm produces 
a logarithmic approximation to the MST, and takes 
time 0(D log n + //logn), featuring a distributed 
scheduling complexity of O(nlogn). Our main 
technical contribution is the use of nearest neigh- 
bor trees, which naturally ensure spatial separation 
at each step, thereby allowing SINR constraints to be 
satisfied. 
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